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Abstract. We consider the problem of approximation of matrix functions of 
class LP on the unit circle by matrix functions analytic in the unit disk in the 
norm of L^, 2 < p < oo. For an m x n matrix function $ in L^, we consider 
the Hankel operator : iJ«(C") ^ H^iC""), 1/p+l/q = 1/2. It turns out 
that the space of to x n matrix functions in splits into two subclasses: the 
set of respectable matrix functions and the set of weird matrix functions. If $ 
is respectable, then its distance to the set of analytic matrix functions is equal 
to the norm of H^. For weird matrix functions, to obtain the distance formula, 
we consider Hankel operators defined on spaces of matrix functions. We also 
describe the set of p-badly approximable matrix functions in terms of special 
factorizations and give a parametrization formula for all best analytic approx- 
imants in the norm of L^. Finally, we introduce the notion of p-superoptimal 
approximation and prove the uniqueness of a p-superoptimal approximant for 
rational matrix functions. 

1. Introduction 

The classical problem of analytic approximation of functions on the unit circle 
T is for a given function ip G L°°, to find a best approximant to ip, i.e., to find 
a bounded analytic function in the unit disk D such that 

llv^ - V'lU-m = distLcx,((y9,if°°). 

A standard compactness argument shows that such a best approximant always 
exists, though it is not necessarily unique in general. However, under certain mild 
assumptions the best approximation is indeed unique. For example, this happens 
if ip is continuous which was proved for the first time in |Kha] . We refer the reader 
to [Pel] for a comprehensive study of the problem of best uniform approximation 
by analytic functions. 

It turns out that this approximation problem is closely related to Hankel oper- 
ators on the Hardy class H^. For a function (p G L°° the Hankel operator 
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is defined by 

where P_ is the orthogonal projection from onto H^. It was proved by Nehari 
(see [M], Ch. 1, §1) that 

IliJ^II =distLoo((^,ff°°). 

Moreover, it turns out that the Hankel operators provide a powerful tool to con- 
structively study the problem of best uniform analytic approximation, see Chapters 
1, 5, and 7 of [Pel] . The problem of uniform approximation by analytic functions 
is also called the Nehari problem. 

The Nehari problem is very important in applications in control theory (see [F] 
and [Pel] ) and is also a useful tool in identification, see |Paj and |BLPT] . Moreover, 
for the needs of control theory it is important to consider not only the scalar case, 
but also the case of matrix-valued functions. 

Let $ be a bounded function with values in the space „ of m x n matrices 
(notationally, $ G L°^(M.m,n))- The problem of best analytic approximation is to 
find a bounded analytic matrix function Q of size m x n such that 

ll-^" -QI|l-(M„,„) = distioo {^,H^{Mm,n)), 

where if°°(Mm,n) is the space of bounded analytic mxn matrix functions and for 
a matrix function \I' G L°°(Mm „) we use the notation 

II^IIl^ =^ esssup ||^'(C)||m^,„, 

where for a matrix A in M^.n we denote by ||^||m,„.„ the operator norm of A as 
an operator from to C". 

As in the scalar case, the following distance formula holds: 

distioo ($,ff-(M^,„)) = \\H^l $ G L°°(M^,0, 

where the Hankel operator : H^{€^) Hl{C^) = L'^{V^) Q H^iC^) is 
defined by 

and P_ is the orthogonal projection onto H'^{C^) (see, e.g., [Pel] , Ch. 2). 

However, unlike the scalar case, even if $ is a polynomial matrix function, 
generically $ has infinitely many best approximants. To choose among all best 
approximants the "very best approximant" , it is natural to consider the notion 
of superoptimal approximation. We refer the reader to § 2 of this paper for the 
definition of superoptimal approximation. 

In this paper we are going to consider the problem of analytic approximation in 
the norm, 2 < p < oo. 
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Let if he a. scalar function in L^. The problem of best analytic approximation 
is to find a function ip in the Hardy class such that 

\\f-i'\\LP = dist LP {<^, HP). 

If 1 < p < oo, then the space is uniformly convex which implies that every 
function if & has a unique best analytic approximant in the L^. The function 
i(j is said to be the p-best analytic approximant to (p. 

In |BS] Hankel operators have been used to study the problem of best analytic 
and meromorphic approximation in for 2 < p < oo (see also |Pr] for a dual 
approach). For ip E the Hankel operator 

is defined by 

HJ = F.pf, feH\ 
where the exponent q satisfies the equality 

i + i^. (1.1) 

p q I 

Throughout this paper we always assume that 2 < p < oo and q satisfies ( 11.11) . In 
the proofs of the results we assume that 2 < p < oo, it is an elementary exercise 
to adjust the proofs for p = 2. 
As in the case of uniform analytic approximation, the following formula holds 

\\H^\\h<i-,hI = distLp{p,HP) 

In § 2 of this paper we discuss in more detail the problem of best analytic approx- 
imation by scalar analytic functions in L^. 

In this paper we deal with the problem of approximation in by analytic matrix 
functions: given a function $ in L^(Mm,n) (i-e., all entries of $ belong to L^), we 
search for a best analytic approximant Q G if^(Mm,n), i-e., 

||$-Q|Up = distiP ($,iJP(M^,„)), 
where for a matrix function \I' G L^(Mm ,i), 



I^IIl. = ||^||l.(m„.„) = ||*(C)||^„,„rfm(C)) 



1/p 



If we consider the Hankel operator 

H^ : H%(r) Hl{U 

defined by 

it is easy to verify that 



H^f = p_$/, / G 



\H^\\ <distiP ($, i7^'(M^,„)) 
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(see Lemma IT3|) . It will be shown in § 4 that if <l> has a p-best analytic approximant 
Q such that for in a subset of T of positive measure, the space of maximizing 
vectors of ($ — Q){C) is one-dimensional, then 

\\H^\\=distLv{^,HPiM^,n)). (1.2) 

Clearly, generically for an m x n matrix A, the maximizing vectors of A span a 
one-dimensional subspace. 

This makes it plausible that for a dense subset of matrix functions $ in LP(Mm 
the distance formula (11.21) holds which would imply that (II. 2p holds for all matrix 
functions $ in U'{Mrn,n)- 

Surprisingly, this is false! 

In § 3 of this paper we obtain certain factorization theorems for analytic matrix 
functions that will be used to study Hankel operators. The main tool used in § 3 
is Sarason's factorization theorem |Sa] . 

In §4 we study the class of matrix functions $ G L°°(Mm,n); for which the 
distance formula (11.21) holds. Such matrix functions are called respectable. We 
obtain several characterizations of the class of respectable matrix functions. 

The main result of § 5 is a construction of a 2 x 2 matrix function $, for which 
(II. 2p is false. Such matrix functions are called weird. 

Thus the space LP(Mm,n) splits in two subsets: the set of respectable matrix 
functions and the set of weird matrix functions. To compute the distance from a 
respectable matrix function $ to the set of analytic matrix functions, we can use 
the distance formula (II. 2p . However, to compute dist^p ifP(Mm,n)) for weird 
matrix functions $, we have to search for another formula. Note that in a sense 
both the set of respectable matrix functions and the set of weird matrix functions 
are massive subsets of L^(Mm,n); see the discussion at the end of §5. 

It turns out, however, that the distance dist^p ($, HP(M.m,n)) from $ to the set 
of analytic matrix functions can be obtained for all matrix functions in as the 
norm of a Hankel operator if we consider Hankel operators acting on spaces of 
matrix functions rather than vector functions. Indeed, If we consider the Hankel 
operator 73"$ defined on the space H'^{S2) of nxn matrix functions with the norm 

Then the norm of the Hankel operator 

is equal to dist^p ($, if^(Mm,n)). Here for an n x /c matrix A the norm ||y4||gn.fe 

is the Hilbert-Schmidt norm of A and H^Hsj *= ll^llsj-"- This will be proved in 
§ 6. We also consider in § 6 Hankel operators acting on spaces oi n x k matrix 
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functions and we introduce in § 6 the class of n x n matrix functions in of order 
k, 1 < k < n. 

In § 7 we obtain a description of the set of p-badly approximable matrix func- 
tions. A matrix function $ G L^(Mm,n) is called p-badly approximable if 

||$|Up = distLP {^,HP{Mm,n)). 

To obtain such a description, we use special factorizations that involve balanced 
matrix functions (see |Pelj . Ch. 14, § 1). 

We also obtain in § 7 a parametrization formula for all p-best approximants. 

In the last section we define the notion of p-superoptimal approximation and 
prove for rational matrix functions the uniqueness of a ]3-superoptimal approxi- 
mant. 

In § 2 we collect necessary information. In § 2.1 we present results on analytic ap- 
proximation in of scalar functions. In § 2.2 we define the notion of superoptimal 
approximation and state some uniqueness results and properties of superoptimal 
approximants. Finally, in §2.3 we define the notion of balanced matrix functions 
and state factorization formulas for badly approximable matrix function. 

Note that it suffices to study the problem of analytic approximation only for 
square matrix functions. Indeed, if a matrix function $ is not square, we can 
add to $ zero columns or zero rows to make it square. For the sake of simplicity, 
beginning §6, we state all the results only for square matrix functions. 

Notation and terminology. Throughout the paper we are going to use the 
following notation and terminology: 

if X and Y are normed spaces and T : X —>■ Y is a. bounded linear operator, a 
vector X G X is called a maximizing vector ofT if 

X 7^ and ||Tx||y = ||T|| • 

if both X and Y are Hilbert spaces and T is a bounded linear operator from X 
to Y, then, by definition, the space of maximizing vectors ofT is 

{x E X : ||Tx||y = ||T|| ■ 

(it is well known that the space of maximizing vectors is a closed subspace of X 
that consists of the maximizing vectors and the zero vector); 
M.m,n is the space of m x n matrices; 

M„ = M„,„; 

if X is a normed space of functions on T, then X(Mm,n) means the space of 
m X n matrix functions whose entries belong to X. If this does not lead to a 
confusion, we say that $ G X for an m x n matrix function $ if $ G X(Mm „); 

if X = 1 < s < oo, and $ G X(M^,„), then 

II^IU = ||$IU{M„.„) = IIpIU, where p(C) = ||$(C)||m^,„, C e T; 
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for 1 < p < oo, the space HP(M.m,n) is the subspace of L^'(Mm,n) that consists 
of matrix functions with entries in H^. By definition, 

ifo^(M„,„) = {Fe L^iM^^n) ■■ F{0) = 0} ; 

for an operator A on Hilbert space (or for a matrix A), the singular values Sj{A) 
are defined by 

Sj{A) = mf{\\A - K\\ : rankisT < j}; 

the Schatten-von Neumann class S,., 1 < r < oo, consists of operators A on 
Hilbert space with finite norm 




for r G [1, oo), we denote by S"^'^ the space of m x n matrices A equipped with 
the Schatten-von Neumann norm (11 .Sp : 

5n def c<n,n. 
r — ' 

ifX = L'', l<s<oo, then X(S'™'") is the space of m x n matrix functions 
with entries in X equipped with the norm 

ll'^'llx(sr-") =^ IIpIU, where p{() = \\<^{()\\s^,n, (eJ. 
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2. Preliminaries 

2.1. Analytic approximation by scalar functions in L'p, 2 < p < oo. 
The problem of analytic approximation in was studied by many mathematicians, 
see, e.g., |Sh] and |Ka]. As we have already mentioned in the introduction, in |BS] 
to study the problem of best analytic approximation in L^, Hankel operators from 
H'^ to were used, where the exponent q satisfies (11.11) (see also [Prj in which a 
similar approach is used) . The approach of |BS] and |Prj is based on the analog of 
Nehari's theorem: 

WHJho-.h^, = distip((^, HP), if e LP. (2.1) 

Moreover, it can be shown that if G L^, then the Hankel operator defined on 
the set of analytic polynomials by the formula 
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extends to a bounded operator from to H"^ if and only if P_(y9 G L^. This can 
be proved in exactly the same way as in the case of classical Hankel operators from 
to (see, e.g., |Pel] . Ch. 1, § 1). In particular, this implies that all bounded 
Hankel operators from to are compact, since the trigonometric polynomials 
are dense in and has finite rank if is a trigonometric polynomial. 
A scalar function (yj e is called p-badly approximable if 

\\v - ^IIlp > IIv^IIlp 

for any ip G H^. 

The following result describes the class of all p-badly approximable functions. 

Theorem 2.1. Let (f be a nonzero function in W . Then ip is p-badly approx- 
imable if and only if there exists an inner function d and an outer function h in 
such that 

^ = = (2-2) 

Proof. Suppose that tp is p-badly approximable. Let / G be a maximizing 
vector of H^. Such a vector exists, since if^ is compact. We have 

\\H^f\\L^ = ||P-(/./|U2 < ll^/IU. 

<||v9|U.||/||M = ||i^^||-||/IU. = ||^^/||l^, 

since / is a maximizing vector. Thus all inequalities in the above chain are equal- 
ities. The fact that the first inequality turns into equality means that (pf G H^. 
The second inequality turns into equality if and only if = c\f\'^ for some c > 0. 
We can multiply / by a constant after which c becomes equal to 1. Let h be 
an outer function in such that \h\ = |/|'^''^. Then / admits a factorization 
/ = 'dih'^^'i, where is an inner function. 

Put g = zH^f G H'^. We have \g\'^ = Iv^/P = Let g = ■&2h, where ^2 is an 
inner function. Then 

^ = 7 = ^^^^^/^- 

It remains to put {} = ■i?i'i?2- 

Suppose now that (p is of the form ({22]) • Put / = h^^i. We have 

lli^^ll • II/IIl. > WHJWl^ = \\h\\L2 = II^IUHI/IU. > lli^^ll • II/IIl.. 

Thus 1 1 1 1 LP = ||-f^<p||, and so p is p-badly approximable. ■ 

Remark. Note that in the case p = 00 the situation is slightly different. A 

bounded Hankel operators from to is not necessarily compact and does not 
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necessarily have a maximizing vector. A badly approximable function ip has the 
form 

if = czd—, 
h 

where c G C, "i? is an inner function, and h is an outer function in H^, if and only 
if the Hankel operator : — > has a maximizing vector, see [Pel] , Ch. 1, 

§1- 

In the case p = 2, Theorem 12.11 means that the 2-badly approximable functions 
are precisely the functions in and a function / G H°° is a maximizing vector 
of the Hankel operator if^ : with a 2-badly approximable symbol ip if 

and only if f = cd, where c is a nonzero complex number and t? is an inner divisor 
of zp. 

Corollary 2.2. Let be a nonnegative function iuL^. The following are equiv- 
alent: 

(i) there exists a p-badly approximable function p & V such that \p\ = 0; 

(ii) log0 G L^. 

Proof. The implication (i)^(ii) is an immediate consequence of Theorem 12.11 
and the fact that the logarithm of the modulus of any outer function is in L^. 

Conversely, suppose that log0 G L^. Let h be an outer function such that 
\h\ = 0?!"^. Clearly, h G . Let p = Zj^. By Theorem 12. H p is badly approx- 
imable. We have 



\h\'-y'i = \h\^/p = 0. m 

Corollary 2.3. Let be a nonnegative function in L^ such that log0 G L^ and 
let h be an outer function such that \ h\ = 0^/^. Then the set of p-badly approximable 
functions with modulus coincides with 



l^^/^iTg ■ ^ is an inner function^ . 



Proof. The result follows immediately from Theorem 12.11 ■ 

2.2. Superoptimal approximation. As we have already mentioned in the 
introduction, even for polynomial matrix functions $ there can be many best 

analytic approximants in the norm. For instance, if $ = and 
^ ^ where / is a scalar function in the unit ball of then F is a 



.0 / 

best approximant to $. 

To introduce the notion of superoptimal approximation, recall the notion of 
singular values of matrices. For a matrix A the jth singular value of A is defined 
by 

Sj{A) = inf{P - K\\ : rankiT < j}, j > 0. 



Clearly, so(A) = ||A||. 

Definition. Given a matrix function $ G L°°(Mm,n) we define inductively the 
sets 0j, < j < min{m, n} — 1, by 

00= {qe if°°(M^,„) : Q minimizes to = esssup ||$(C) - Q(C)II j ; 

I C6T J 

0j = I Q E 0J-1 : Q minimizes tj =^ esssup Sj($(C) — Q{C)) f ; J > 0. 
Functions in 0k = 0min{m,n}-i are called superoptimal approximants to $ by 

fc>0 

bounded analytic matrix functions. The numbers tj = tj{^) are called the super- 
optimal singular values of $. Note that the matrix functions in 0q are just the 
best approximants by analytic matrix functions. 

In other words, a superoptimal approximant minimizes the essential suprema of 
the singular values of ($ — Q){C) lexicographically. 

It was proved in [Py] that if $ e {H°° + C)(Mm,n) (i-e., each entry of $ is a sum 
of a a continuous function and an H°° function), then $ has a unique superoptimal 
approximant. Moreover, if Q is the unique superoptimal approximant to $, then 

s,((<i>-g)(c)) =t„ cgt. 

Later in [PTl] the same results were obtained under a less restrictive assumption 
on $. We refer the reader to |Pelj . Ch. 14 for a detailed presentation of the theory 
of superoptimal approximation. 

2.3. Balanced matrix functions and factorizations of badly approx- 
imable matrix functions. A matrix function $ in L°°{M.m,n) is called badly 
approximable if 

II'^'IIl- < ll'^' - Q\\l-° 

for any Q G H°-{Mm,n). 

A matrix function $ is called very badly approximable if the zero matrix function 
is a superoptimal approximant to $. 

In |PY] and [AP] the set of badly approximable matrix functions of class 
{H°° + C)(Mm,n) was described in terms of certain special factorizations (see also 
|PT2] in which a geometric description of very badly approximable matrix functions 
was obtained). Such factorizations involve certain special unitary- valued matrix 
functions (balanced matrix functions), see |Pelj . Ch. 14, §1. To define balanced 
matrix functions, we have to introduce several notions. 

A matrix function G if°°(M„ fc) is called inner if on the unit circle G*B = Ik, 
where Ik is the matrix function identically equal to the identity matrix 7^.. 

A matrix function F G if^(Mm,n) is called outer ii the set 

{Ff : / is a polynomial in //^(C")} 
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is dense in if^(C™). 

Finally, a matrix function F G if^(Mm,n) is called co-outer if the transposed 
function is outer. 

It is well known (see, e.g., [N] or |SFj ) that if is a matrix function of class if^, 
then $ admits an inner-outer factorization 



where B is an inner matrix function and F is an outer matrix function. 

Let k < n and let T be an n x inner and co-outer matrix function. It is 
well known (see [Pel] , Ch. 14, §1 and [H], Ch. 9) that there exists an inner and 
co-outer matrix function of size n x [n — k) such that the matrix function 



takes unitary values or, in other words, is unitary-valued. Matrix functions of the 
form (12. 3p are called balanced matrix functions. If we want to specify that the 
analytic part of V has k columns, we say that V is a k-halanced matrix function. 
In the case = 1, fc-balanced matrix functions are also called thematic matrix 
functions. If A; = n by a fc-balanced matrix function, we mean a matrix function 
of the form r/„, where r is a complex number of modulus 1. 

Balanced matrix functions have many interesting properties, see [Pel] , Ch. 14, 
§ 1. They have been used to obtain a description of badly approximable matrix 
functions, to parametrize the set of best analytic approximants, to characterize 
very badly approximable matrix functions, to prove the uniqueness of superoptimal 
approximants, and to construct the superoptimal approximant (see, |PYj . |PTlj . 
[AF] . and [M], Ch. 14). 

In particular, it was shown in |PYj (see also [Pel] , Ch. 14, §2) that if $ is 
a matrix function in L°°(Mm,n) such that the Hankel operator 
: if^(C") — » /7i(C™') has a maximizing vector, then $ is badly approximable 
if and only if $ admits a factorization 



where V and are thematic matrix functions, is a scalar inner function, /i is a 
scalar outer function in if^, and is a matrix function of size (m — 1) x {n — 1) 
such that ||$#(C)||m™,_i,„_i < 1 almost everywhere on T. 

Another characterization of badly approximable functions was obtained in [APj 
(see also [PeT] . Ch. 14, § 15). Let $ G [h°° + C)(M^,„) and let k be the number 
of superoptimal singular values tj($) equal to to('^) (in other words, k is the mul- 
tiplicity of the superoptimal singular value to(^))- Then $ is badly approximable 
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V = (T 0) 



(2.3) 




if and only if 

(U \ 

$ = \\H^\\W* V*, 

V ; ' 

where U is a. k x k very badly approximable unitary-valued function of class 
H°° + C, V and W are /c-balanced matrix functions, and is a matrix func- 
tion in {H°° + C){Mm-k,n-k) such that ||$#(C) IIm^.^.^.^ < 1 almost everywhere on 
T and \\Hq,^\\ < 1. Actually, the condition $ G + C)(Mm_„) can be relaxed 
(see |AF] and [PeT], Ch. 14, § 15). 



3. Factorization of analytic matrix functions 

In this section we obtain several factorization theorems for analytic matrix func- 
tions that will be used to study Hankel operators. 

We are going to use the following result by D. Sarason that is an analog of Riesz 
factorization: 

Sarason's Theorem [Sa]. Let Ti he a separable Hilhert space and let ^ he an 

analytic integrahle BiTi) -valued function on T. Then there exist analytic square 
integrahle functions Q and TZ such that 

^ = Q7^, 7^*7^ = (^*^)^^^ and Q*Q = mZ* a.e. onT. (3.1) 

The following theorem can be deduced easily from Sarason's theorem. Recall 
that 2 < p < oo and q satisfies (11. ip : as usual, p' is the dual exponent: 1/p+l/p' = 
1. 

Theorem 3.1. Let ^ G ifP'(M„). Then there exist matrix functions 
F G Hi(Mn) and G G i/^(M„) such that 

m = FG and m\Lp'{s,) = \\F\\L'^{s,)\\G\\ms,)- 

Proof of Theorem 13. 11 Clearly, we may assume that is a nonzero function. 
Suppose that Q and IZ satisfy the requirements of Sarason's theorem. Let /i be a 
scalar outer function such that 

IMC)l = ll^(C)llsf''^ Cel. (3.2) 

Put 

F = hQ and G = \n. 

h 
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By Sarason's Theorem, 

= f mow's f'''^'^' dmiC) 

JT 
JT 

Similarly, 

JT 

mOWi^'^' dm{C) 

J li- 
lt follows that 

We need a version of Theorem 13.11 in the case ^E'(C) has rank /c for C G T. The 
following result can be deduced from Sarason's theorem. 

Theorem 3.2. Let 1 < k < n and let ^ fee a function in H^(M.n) such that 

rank\E'((^) = k on a subset ofT of positive measure (3-3) 

Then there exist matrix functions T G if^(M„^fc) and Q G if ^(Mjt^„) such that 

^ = TQ and ||^(C)l|s? = ll^(C)ll5-'^-|l^(C)ll5,^.'M CeT. (3.4) 

Proof. Clearly, each minor of ^ belongs to the Hardy class for some s > 0. 
It follows now from the uniqueness theorem for Hardy classes that condition (13.31) 
is equivalent to the fact that rank\E'(C) = k almost everywhere on T. 

Let Q and TZ he n x n matrix functions satisfying the requirements of Sarason's 
theorem. Then 

||^(C)lls^ = IIQ(C)ll55l|7^(C)ll5^ (3.5) 

We need the following elementary lemma whose proof is given here for complete- 
ness. 

Lemma 3.3. If A and B are operators on Hilbert space, rankAi? = k, and 
\\AB\\sj^ = \\A\\s2\\B\\s2> then rankA = rankS = k. 
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Let us first complete the proof of Tlieorem 13.21 
By Lemma [3.31 fl3.5l) implies that 

rank = k and rank7?.(C) = k for almost all C ^ I'- (3-6) 

Consider the inner-outer factorization of TZ: 

n = Tg, 

where T is an inner matrix function and Q is an outer matrix function. It follows 
from (13.61) that T has size n x k and Q has size k x n. We can define now the 
function by JF = QT. Since T takes isometric values almost everywhere on T, 
it follows that 

11-^(0115-'= = 112(0115^ and ||^(C)ll5^" = l|7^(C)l|5^ 

and so ([MD holds. ■ 

Proof of Lemma 13.31 Clearly, if rank A < k or ranki? < k, then rank AB < k. 
Suppose now that the conclusion of the lemma is false. Without loss of general- 
ity we may assume that rank A > k. Let P be the orthogonal projection onto 
Rangers. Then AB = PAB. Clearly, \\A\\%^ = ||PA|||^ + ||(/-P)A|||^. Since 
rankP = k and rankA > k, it follows that ||PA||s2 < ||^||s2- Thus 

\\AB\\s, = \\PAB\\s, < ||PA||s,||P||5, < \\A\\s,\\B\\s, = \\AB\\s, 

and we get a contradiction. ■ 

We need the following consequence of Theorem 13.21 

Theorem 3.4. Let ^ e Hp' (Mn) such that 

rank^(C) = A;, (eT. 
Then there exist matrix functions F G H'^(M.n^k) G G if^(Mfc „) such that 
= FG and W^Wlv'^s'^) = \\^\\li{s^,''')\\^\\l^s'^'")- 
Proof. As in the proof of Theorem 13. ![ we put 

F = hJ^, G = \q, 
h 

where T and Q are matrix functions satisfying the requirements of Theorem 13.21 
and /i is a scalar outer function satisfying (13. 2p . The fact that F and G satisfy the 
conclusions of the theorem is exactly the same as in the proof of Theorem 13.21 ■ 
The case of matrix functions of rank 1 is of special interest. We treat this case 
separately, without using Sarason's theorem. 

Lemma 3.5. Let ^ G //^(Mm,„) such that 

rank\E'((^) = 1 on a subset ofT of positive measure (3.T) 
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Then there exist vector functions u G H'^{C"^), and v G if^(C"), such that 

*(C) = n(Cy(C), CeT. (3.8) 

and 

\HO\\c- = HC)\\c'^ = \\^iO\\lll„. (3.9) 

Proof. Condition (13. 7p means that each 2x2 minor of \1/ vanishes on a set of 
positive measure. Since \E' G if^(Mm „), it follows that all 2 x 2 minors of \E' are 
identically equal to zero. Thus condition (13.71) implies that rank\l'((^) = 1 almost 
everywhere on T. 

Let h be an outer function such that 

IMC)P = II^(C)IIlmm„,„), Cgt, 

and let G = h~^^! . Clearly, G G H'^{Mm,n)- Consider the columns of G. Let C be 
the invariant subspace of multiplication by z on if^(C™) spanned by the columns 
of G. By the Beurling-Lax theorem (see [N]), there exists an inner function T of 
size m X k such that C = TH'^{C''). Since rankG^O = 1 almost everywhere, it 
follows that k = 1. Then there exist functions Ui , f 2 , ■ ■ ■ , f „ such that the columns 
of the matrix function G are f iT, f 2T, ■ • • , u„T. Let 

\vn J 

Clearly, v G H'^{C") and G = Tf*. It remains to put u = hT and observe that 
u G //^(c™) and both ([33]) and (EH) hold. ■ 

Theorem 3.6. Let \E' fee a rank one matrix function in if^'(Mm,n)- Then there 
exist column functions f G H'^{C™') and g G i/^(C") S'uc/i that 

m = fg^ and ||^||lp'(m^ = ||/||l9(c-) IklU^CC")- (3.10) 

Proof. Let u and f be the column functions satisfying ( 13. Sp and ( 13. 9p . Let 
be a scalar outer function satisfying (13. 2p . Put 

f = hu and (7 = — u. 

It is easy to verify that / G //"(C™), ^ G H'^{C'^), and the equalities in (l3A0il hold. 
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4. Respectable matrix functions 



The main result of this section is Theorem 14.41 which gives us several characteri- 
zations of the set of matrix functions $ e L^(Mm „), for which dist^p ($, HP(M.m,n)) 
is equal to the norm of the Hankel operator : if^(C") — > H^{C"^). The de- 
scription of this class of matrix functions (such matrix functions will be called 
respectable) makes it very natural to hope that all matrix functions in LP(Mm,n) 
are respectable. However, it will be shown in § 5 that this is not true. 

Definition. A matrix function <l> G LP(Mm,n) \ HP{M.m^n) is called regularly 
approximable if there exists a best approximant Q G H^(M.m^n) such that the 
space of maximizing vectors of ($ — Q){C) is one-dimensional on a subset of T of 
positive measure. 

It follows from the Hahn-Banach theorem that for $ G LP(M.m,n), 



distLP {^,HP{Mm,n)) =SUp 



trace (<I>(C)^(C)) d'm{C) 



where the supremum is taken over all \Ef G Hq (M„_m) (i-e., G HP'{Wln,m) and 
^(0) = 0) such that < 1. 

Since the space L^{Wlm,n) is reflexive, it follows that for a matrix function 
$ G LP{M.m,n) \ HP{M.m,n) there exists a matrix function ^ G i^o'(M„_m) such 
that 

II*IIlp'(s"-) = 1 and / trace ($(0^(0) f^"^(C) = distip($,^^(M„,0). (4.1) 

Jt 

Such a function \1' is called a dual extremal function of $. 

Recall that for a matrix function $ G {WS.rn,n) , we consider the Hankel operator 
: HI {C") ^ H^iC"") defined hj 

where 1/p + 1/q = 1/2. 

As we have mentioned in § 2, for Hankel operators with scalar symbols, for- 
mula fl2.ll) holds. Thus it is easy to see that the norm of the Hankel operator 
: //"(C") Hl^C"") is equivalent to the distance in Lp from $ to //p(M^,„). 
Since in the case of scalar symbols all bounded Hankel operators from H'^ to 
are compact, we can obtain the following result. 

Lemma 4.1. For an arbitrary matrix function $ in LP(Mm_„), the Hankel op- 
erator H^ : if''(C") H^^C"") IS compact. 

Corollary 4.2. Let $ G LP(Mm,n)- Then /ias a maximizing vector in 
H'i{C^). 
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The following lemma gives us an upper estimate for the norm of 

Lemma 4.3. Let $ G Lp(M„,„). Then 

\\H4 <dist,.p {^,HP{M^^^)). 

Proof. Since H<!,-q = His, for an arbitrary Q in H^{M.m,n), it suffices to prove 
the inequahty 

m,n J ■ 

Suppose that / G Hi{C") and g G Hl{C^). We have by Hdlder's inequality, 
\{H^f,g)\ = \{^f,g)\< [ \^fg*\dm 

JT 

< ll*(C)||^_ c^m(C)) 11/(011?:.) 11^(011^™) 

= ll'^l|LP(M,„,„)||/|U9(C")llfi'IU2(C")- ■ 

The following theorem gives us several characterizations of the class of matrix 
functions $, for which \\H^\\ = dist^p ($, ifP(Mm,„)) . 

Theorem 4.4. Let $ G LP(Mm,n) \ -f^^(Mm,n)- The following are equivalent: 

(i) \\H4=distLp{^,HP(M^^^)); 

(ii) $ belongs to the closure of the set of regularly approximable functions in L^; 

(iii) $ has a dual extremal function such that rank\E'(C) = 1 on a set of 

positive measure; 

(iv) $ has a dual extremal function \E' such that rank\l/(0 = 1, G T; 

(v) if Q is a best approximant to $, then $ — Q admits a factorization 

$_g = H^* ^ h/*, (4.2) 

where V and are thematic matrix functions, d is a scalar inner function, h is 
a scalar outer function in LL^ , and is an (m — 1) x (n — 1) matrix function such 

that ||$#(C)||m_,,„_, <IMC)P^CeT. 

Note that in fl4.2p the outer function h must satisfy the equality 

|MC)r/^ = ||(<f-g)(C)||M_, CeT. 

Remark. Since the set of matrices, for which the space of maximizing vectors 
is one-dimensional is dense in the space of matrices, this suggests a hope that the 
set of regularly approximable mx n matrix functions is dense in L^(Mm„). If this 
were true, then the distance formula \\LL^\\ = dist^p ($, -ff^(Mm,n)) would hold for 
an arbitrary matrix functions in L^(Mm,n)- Surprisingly, we will show m § 5 that 
this is not the case. 
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Definition. Matrix functions $ G LP(Mm,r!,) \ HP(^m,n) satisfying one of the 
conditions (i)-(v) in the statement of Theorem 14.41 are called respectable matrix 
functions. If a matrix function $ G L^{M.m,n) \ H^{^m,n) is not respectable, it is 
called a weird function. 



It follows immediately from Theorem 14.41 that the set of respectable functions is 
closed in L^, while the set of weird functions is open. 



Proof of Theorem 14.41 We start with the proof of the implication (iv)^(i). 
Let \I' be a dual extremal function such that rank^((^) = 1, C G T. Then \l/ 
satisfies fHll) . 

Since is always less than or equal to distLp($, HP(Mm,n)) , we have to show 

that 

\\H^\\ >distip($,i/f(M^,„)). 
By Theorem 13.61 there exist functions / G i/^(C") and g G Hq{C^) such that 
^ = fg' and 1 = ||^|Lp'(m^,„) = 

Without loss of generality we may assume that 
have 



L9(C™) \\g\\L^{C")- 
L9(C") = ||5'||l2(C'") 



1. We 



\H4 > \{H^f,g*) 



trace dm 



trace($/(7*) dm 



distiP ($,ifP(M^,„)). 



trace ($\I') dm 



Next, let us show that (i)^(v). Let / G H'^{C^) be a maximizing vector of Hq, 
and let Q G H^{M.m,n) be a best approximant to $. We have 

||^^$/IU2(c-) = \\H^-Qf\\LHa^) = W^-C^ - Q)f\\LHc"^) < IK* - (5)/||l2(c-) 

< ||($ - Q)||lp(M™,„)||/||//'!(C") = ll-f^*!! • \\f\\Hi(C^) = \\H.s,f\\L2(Cmy 

Hence, both inequalities are equalities. The fact that the first inequality turns 
into equality means that ($ -Q)/ G HliC""). The fact that the second inequality 
turns into equality means that f{() is a maximizing vector of ($ — for almost 
all C £ T and 

' 'i($-g)(c)iiM_ii/(c)iic")'rfm(c) 

\ 2/p / ^ \ 2/9 

Jl(<f - Qmrm^JruiOj ^ ||/(C)ll^.^rn(C)j , 

17 



i.e., the corresponding Holder inequality turns into equality, which implies that 
||($ - Q)(C)||m„,„ = c||/(C)||c"'' fo^^ some constant c. Since 

ii(if^/)(C)iic"^ = ii(<f-g)(C)iiM_ii/(c)iic", 

it follows that ||(iy$/)(C)||c- = c||/(C)|lc". Multiplying the maximizing vector / 
by a suitable constant, one can always make the constant c equal to 1, and so we 
may assume that 

\\{H^fm\\c- = \\f{c)rJi 

Let h he a. scalar outer function such that 

\hic)\ = m^fm\\c-, ceT, 

and so 

II(*-Q)(C)I|m. =IMC)P^ Cel. 



Put g = zH^f e iJ2(c™). Then 

HOWc- = \H0\ and ||/(c)||c. = |MC)p/^ CeJ. 

The vector function / admits a factorization / = dih'^^'^v, where 'di is a scalar 
inner function and -y is an n x 1 inner and co-outer function, while the vector 
function g admits a factorization g = d2hw, where ^2 is a scalar inner function 
and w; is an m X 1 inner and co-outer function. 
Let now 

V =[v Q) and VT* = ( t« S ) (4.3) 

be thematic matrix functions (see §2.3). 

Consider the matrix function iy($ — Q)V . Its upper left entry is equal to 

^ = - Q)v = ^2h-^g\^ - Q)hh-^'''f = h^2h-^'''-^g'H^f 
= zdh-'/'-'g'g = zdh-'/'-'lhl^ = zdj^ = z§ 



where d = '&i'&2- 

We have |C(C)| = ll(^ ~ Q)(C)I|m,„,„- Since both V and W are unitary-valued, it 
is easy to see that $ — Q has the form (14.21) . 

To prove the implication (v)^(ii), we need the following lemma. 

Lemma 4.5. Suppose that ^ is a matrix function that admits a factorization 

( zmiih^i'i \ 

^ = W*\ \V\ (4.4) 

where 'd, h, V, and W are as in the statement of Theorem 14.41 Then $ is 
p-badly approximable. 
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Proof. As we have already observed, for an arbitrary matrix function $ in 
LP(Mm,n) the following inequalities hold: 

\\H4 < distiP (<I>,/JP(M„,„)) < ||$||//P(M„,„). 

It suffices to prove that if $ is as in fl4.4p . then ||-f^$|| > ||'^'||hp(m„,„)- Consider the 
matrix functions V and W: 

V = {v 0) and W = {w E) 
Let / = h'^^'^v. It is easy to verify that 

H^f = $/ = zMw and \\H^f\\mc^) = ll'^'lUnM^.n) ll/IU^C") 
which implies that ||-f^<i.|| > ||*&||lp(m™,„)- ■ 

(v)^(ii). Let R = ^ — Q. For £ > we consider the function defined by 

R, = W*i ]V*. 
V ^# J 

By Lemma 14.51 R and Rs are p-badly approximable matrix functions. We define 
the function $e by $e = i?^ + Q. 

Since Q G H^iMm^n) and Rs is p-badly approximable, it follows that Q is a 
p-best approximant to $£. Clearly, for C ^ T, the space of maximizing vectors of 
ReiC) is one-dimensional, and so is a regularly approximable matrix function. 
The result follows from the obvious fact that 

||$e-$||LP^O as e ^ oo. 

To show that (iii)^(iv), we observe that (iii) implies that each 2x2 minor of 
^ vanishes on a set of positive measure. By the uniqueness theorem for the Hardy 
classes, it follows that all 2 x 2 minors of \1/ are zero almost everywhere on T which 
proves (iv). 

Let us prove now that (ii)^(i). Clearly, it suffices to show that if $ is regularly 
approximable, then \\H^\\ = dist^p ($, ifP(Mm,n)) • Let Q be a matrix function 
in ifP(Mm,n) such that the space of maximizing vectors of ($ — Q){C) is one- 
dimensional on a subset of T of positive measure. Let \1/ be a dual extremal 
function of $. It follows easily from (14.11) that 

trace (($ - g)(C)^(C)) = ll($ - Q)(C)||m_||*(C)I|5-, C e T. (4.5) 
We need the following elementary lemma. 

Lemma 4.6. Let A G Mm,n o,nd B G M„ „i he matrices satisfying 

|trace(v45)| = \\A\\m^J\B\\s--^. 

Assume that the space of maximizing vectors A is one- dimensional. Then B has 
rank 1. 

19 



Proof. Without loss of generality we may assume that m = n. By considering 
the polar decomposition of B, we may assume that B is positive, i.e., {Bx, x) > 
for every vector x. Let ei, ■ ■ • , e„ be an orthonormal basis of eigenvectors of B and 



let Bcj = IjCj. We have 



trace(v45)| = |trace(5A)| 



J2iBA^ 



Y^{Ae„Be,] 



On the other hand. 



\\A\\.\\B\\s^^ = \\A\\.J2iBe„e,) = \\A\\J2lr 

j=i i=i 

It follows that if ||Aej|| < \\A\\, then Ij = 0. By the hypotheses there can be only 
one j, for which ||^ej|| = \\A\\, which proves the result. ■ 

It follows from (14.51) and from Lemma [4.61 that $ satisfies (iii). Since we have 
already proved that (iii)^(iv) and (iv)^(i), it follows that $ satisfies (i). 

The fact that (iv)^(iii) is obvious. It remains to prove that (v)^(iv). Suppose 
that $ — Q is factorized as in (14.21) . Without loss of generality we may assume 
that 11$ — Q II LP = 1 Define the matrix function ^ by 



^ = ^^9/^1+2/^ 0) ("^ 



where v and w are as in (14. 3p . 

Clearly, rank^(C) = 1, C e T. We have 



i^ir , 

I iIlp'(s';'™) 



and 



j trace (($ - Q)^) dm = j z^^+^Z" trace j ('^ ~ ^) j 



trace 



>|2 0^ 





dm = \\h\\l2 = 11$ - QWlp = 1- 



This completes the proof. ■ 

Remark. Note that in the case of analytic matrix approximation in the L°° 
norm it is not true that for an arbitrary matrix function $ G L°°(Mm,n) there exists 
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a dual extremal function in Hq{S^'"^). Moreover, it was shown in [Pe2] that a dual 
extremal function exists if and only if the Hankel operator : H'^{C"') —>■ ifi(C™) 
has a maximizing vector. 

However, in the case p = cx), if a dual extremal function exists, then there exists 
a dual extremal function \E' such that rank\Ef(C) = 1 almost everywhere on T, see 



5. Weird matrix functions 



The main result of this section is a construction of a weird matrix function of 
size 2x2. 

Lemma 5.1. There exists a bounded 2x2 matrix function B such that B* = B, 
trace = 1, C ^ TT, the eigenvalues of B{(), ( eT, are positive and separated 
away from zero and there is no constant self-adjoint matrix C such that 

rankC = 1 and trace i?(C)C =1, C ^ "T- 

Proof. Let a be a real bounded scalar functions, (3 a complex scalar bounded 
function such that the functions a, /5, /?, and 1 are linearly independent, and the 
function a{l — a) — is positive and separated away from zero. Put 

B = 

Clearly, B* = B, the eigenvalues of 5(C), C ^ T, are positive and separated away 
from zero, and trace 5(C) = 1, C ^ I'- Suppose that C is a self-adjoint constant 
matrix such that rankC = 1, and trace i?(C)C = 1, C ^ T. Then C has the form 




C 



a b 

b a->|2 

where a is a nonzero real number and 6 is a complex number. We have 

trace i?(C)C = aa(C) + 6^(C) + &/?(C) + a~'\b\\l - a(C)) = 1, C e T. 

Thus 

(a - a-'\b\')a{0 + b^ + mO + a-'\b\' -1 = 0. 

Since the functions a, /3, /3, and 1 are linearly independent, this equality is impos- 
sible. ■ 

Consider the Wiener-Masani factorization of B (see |WM] ) : 



B = 
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(5.1) 



where \1/ is an invertible bounded analytic function in D. Put 

A = 

and consider the Wiener-Masani factorization of A^: 

= QQ*. 

Let U be the matrix function defined by 

U = zQ-^A. (5.2) 
Then [/ is a unitary-valued function on T: 

U*U = A{Q*)-^Q-^A = I. 

Clearly, 

AU-^ = zQe H^(M2,2)- 
Let us show that U is p-badly approximable. 

Lemma 5.2. Let A be a self-adjoint 2x2 matrix function such that 

traceA(C) = l, C e T, 

and the eigenvalues of A{Q are positive and separated away from zero. Suppose 
that U is a unitary-valued matrix function on T such that AU~^ G H^{Wl2.2) ■ 
Then U is a p-badly approximable matrix function. 

Proof. Let F G Hp{M2,2)- For C G T, we have 

|trace((f/-F)Af/*)(C))| < \\{U - F)AU*){0\\s, < WiU ~ F){0\\m,^,. 
Thus by Holder's inequality, 

i/p / /■ \ i/p 



\U-F\ 



LP {M2.2) 



J \\U-F\\Pdmj > (^J \tTace{{U - F)AU*)\'' dm 

> I jtrace {{U - F)AU*) \ dm > [ trace (([/ - F)AU*) dm 

/ trace (f/Af/* )c;m- / trace {F AU*) dm 
Jt Jt 



trace (UAU*) dm 



trace A dm 



1. 



Note that 



/ trace (FAU*) dm = 0, 
Jt 



since FAU* G Hq{M.2^2)- Thus U is p-badly approximable. ■ 

To prove that the matrix function U defined by (15.21) is weird, we need the 
following lemma. 
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Lemma 5.3. Let A be a bounded positive definite matrix function on T whose 
inverse is also bounded and let A = \[^\[^*, where \l/ is an invertible matrix function 
in H°° . A matrix function F in H°° satisfies the equation 

AF* = FA (5.3) 

if and only if 

F = ^C^-\ (5.4) 
where C is a constant self-adjoint matrix. 

Proof. Put 

C = ^-^F^. 

Then C is an matrix function and (15. 4p holds. By ( 15.31) . we have 

AF* = A{<il*y^C*'^* = FA = ^HC^-^A. 
Since A = we obtain 

v|>vi>*(\i/*) -1(7*11/* = \i/(7\i/-i\[/vi>* 

which implies C = C*. Since C is an H°° matrix function, it must be constant. 

Clearly, if C is a constant self-adjoint matrix and F is defined by (15. 4p . then F 
satisfies equation (15.31) . ■ 

Theorem 5.4. The matrix function U defined by (15.21) is a weird p-badly ap- 
proximable function. 

Proof. Assume that U is respectable. By Lemma 15.21 U is p-badly approx- 
imate. Then \\Hu\\ = ||^7||lp(M2,2) = 1- 

Let / G if'^(C^) be a maximizing vector of iff/ of norm 1. We have 

1 = WHufWmc^) = ||P_t//|U2(c2) < ||t//|U2(c2) = WfWmc^) < ||/||l,(c2) = i. 

Thus all inequalities in this chain of inequalities are equalities. The equality 
II/I|l2(c2) = ||/||l9(c2) means that ||/(C)||c2 = 1, C e T, while the equality 
||P_f//|U2(c2) = ||f//||L2(c2) means that Uf G H^C^), and so Uff* e H^{M2,2) 
or, in other words, f f*U~^ G H^{M2,2)- Put 

F = ff*A-\ 

Then F satisfies (15. 3p . Hence, by Lemma [5.31 F has the form F = \E'C\E'^^, where 
C is a constant self-adjoint matrix. Since F has rank one on T, it follows that 
rankC = 1. Clearly, 

//* = FA = ^C^-i^^* = ^C^*. 
Let B = v[f*\I/ be the matrix function obtained in Lemma [5.11 By (15. ID . we have 

trace = trace \E'*\I^C = trace vPC^* = trace//* = L 
This contradicts Lemma [5.11 ■ 
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Remark. The results of Sections 4 and 5 show that the class LP(Mm,n) splits 
into two subsets. The first subset consists of respectable matrix functions and for 
respectable matrix functions $ the distance dist^p ($, iJ^(Mm,n)) can be computed 
by formula fll.2l) . The second subset consists of weird matrix functions and for 
weird matrix functions $ we have to find another formula to compute the distance 
dist^p ($, -ff^(Mm,n)). Such a formula will be obtained in the next section. 

Let us explain that in a sense both the set of respectable matrix functions and 
the set of weird matrix functions are massive subsets of L'P{Wlm,n)- First of all, the 
set of weird matrix functions is open and nonempty, as we have just seen. 

Secondly, if $ G L'P{Wlm,n) and Q is an arbitrary function in if°°(Mm,n); then $ is 
respectable if and only if Q is. Thus to characterize the set of respectable matrix 
functions, we can restrict ourselves to the case of p-badly approximable respectable 
matrix functions. It is easy to see that the set of respectable badly approximable 
matrix functions has nonempty interior in the set of p-badly approximable matrix 
functions. Indeed, it is easy to verify that the p badly approximable matrix function 

* = (o o) (S'S) 

belongs to the interior of the set of respectable p-badly approximable functions. 

However, we do not know whether the set of respectable matrix functions has 
nonempty interior in the space L^(Mm,ri). In particular, we do not know whether 
the matrix function $ defined in (15.51) belongs to the interior of the set of re- 
spectable matrix functions has nonempty interior in the space LP(M.m,n)- 



6. Hankel operators on spaces of matrix- valued functions 

We have already mentioned in the introduction that the problem of analytic 
approximation of matrix functions can be reduced to the case of square matrix 
functions and beginning this section we assume that $ G L^(M„). 

For $ G L^(M„), we consider the Hankel operator defined on the space 
if "(5^) to the space H^{S^) defined by 

H^F = P_$F, 

where P_ is an orthogonal projection from the space L?'{S^) onto the subspace 
if2(5^) ''^ L^{Sl)QH^{Sl). 

Theorem 6.1. Let $ G Lp(M„). Then 

\\H<i>\\m(s?,)^Hl(s-,) = distiP (<l>,iJP(M„)). 
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Proof. Suppose that $ G Lp(M„). Then for F G H'^iS^) and Q G ifP(M„), we 
have 

< 11$ — g||LP(M„) ||-^||/i'9(S^) 

by Holder's inequahty. Thus \\H^\\ < dist^p ($,ifP(M„)). 

To prove the opposite inequahty, we are going to use Theorem 13.11 that has been 
deduced from Sarason's theorem. Let \l/ be a dual extremal function of i.e., be- 
longs to Hq (M„) and satisfies (14. ip . By Theorem 13. 11 there exist matrix functions 
F G Hi(Mn) and G G H^(Mn) such that 

^ = FG and \\F\\l,(^s^)\\G\\l2^s^) = 1. 

Without loss of generality we may assume that ||-F||l9(s5) = 1 and ||G'||l2(S5) = 1- 
We have 



lif^ll > \(H^F,G* 



T 



trace [{H^F)G) dm 
trace dm 



T 



lL2(S5)| 



trace($FG') dm 
= distLP ($, //P(M^,„)) 

by gH). ■ 

It follows immediately from Theorem 16.11 that 

\\H^\\hi{C")^H'^{C") ^ \\H^\\Hi{Sr^)^Hl(Sr^)- 

Note that this inequality can also be obtained easily from the definitions of 
and i3"$. 

Theorem 6.2. Let $ G L^(M„) and let Q be a best approximant to $ in 
if^(M„). r/ien the following assertions hold: 

(i) G H''{Mn) IS a maximizing vector of H^, then ($ - Q)F G H'i^S^); 

(ii) t/ie function 

Ch-.||($-Q)(C)||m„ (6.1) 

does not depend on the choice of a best approximant Q; 

(iii) if H(S> has a maximizing vector F such that rank F{() = n on a subset ofT 
of positive measure, then $ has a unique best approximant in iJ^(M„); 

(iv) if Fi and F2 are maximizing vectors of Hi^, then 

||Fl(C)l|5^'=c||F2(C))||5? 

for some positive constant c; 
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(v) if Q is a best approximant to $ in ifP(M„) and F is a maximizing vector of 
Hq>, then the matrix 

1 



-($-g)(C), CeT, 



II($-Q)(C)IIm. 

is isometric on the range of F{(). 

Proof. Let us fix a maximizing vector F of We liave by Holder's inequality, 

\\H^F\\l2^s",) = ||P-(($-Q)F)|U2(s.) < ||($-Q)F|U2(sn) (6.2) 

< ll'^' - <5||lp(M„)||-^||//9(S^) = \\H^\\Hi{S^)^H^{S'i)\\F\\HiiSll)- 

Since \\H^F\\l2(^s^) = ll-f^^ll ' it follows that both inequalities in (I6.2p 

are equalities. 

The fact that the first inequality in (16.21) turns into equality means that 
(<l>-Q)FGi/2(5-), i.e., 

($-g)F = /f$F (6.3) 

which proves (i). 

To prove (iii), we observe that since F G ^^{S^), it follows that if iai\kF{Q = n 
on a set of positive measure, then rankF((^) = n, ( & T, almost everywhere on T. 
Hence, 

^-Q = iH^F)F-\ 

and so Q is uniquely determined by $. 

The fact that the second inequality in (16. 2p turns into equality means that there 
exists c > such that 

m-QmrM^ = c\\F{o\\%,., c^t, (6.4) 

and 

IK* - QKOFiOWs^ = m- Q)(C)IImJ|f(C)IIs5, C e t. (6.5) 

Clearly, (iv) follows immediately from ( 16. 4p . 

If we normalize the maximizing vector F by the condition 

= ||f/^r, (6.6) 

then integrating (16. 4p . we obtain 

= 11"^ ~ ^IIlp{m„) ^ '^II-^IIl9{S2)- 

Hence, under condition f l6.6p . 

m-QmrM,. = \\F{cWs^, c^t, (6.?) 

and so the function (16. ip is uniquely determined by $. This proves (ii). 
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It remains to observe that (v) follows from (16.51) and from the fact that for nxn 
matrices A and B the equality 

M-Bllsj = ll^ll ■ ||-B||sj 

holds if and only if the restriction of A to the range of -B is a multiple of an 
isometry. ■ 

Definition. For a function $ G L^(M„), the function (16. ip is called the distance 
function of $. We denote the distance function of $ by d$: 

d$(C) = ||($-g)(C)l|M„, CeT, (6.8) 

where Q is an arbitrary best approximant to 

The following result describes the set of all nonzero distance functions of matrix 
functions in L^'(M„). 

Theorem 6.3. Let d > be a nonzero function in L^. Then d is the distance 
function of a matrix function $ G L*'(M„) if and only if log d G . 

Proof. If $ G LP(Mn) \ HP(Mn) and Q is a p-best approximant to $ and 
d{C) = ||($ — Q)(C)||m„, the fact that logrf G follows immediately from (16.41) . 

The converse follows from Corollary 12.21 by considering matrix functions of the 
form 

■■■ 0\ 
■ ■ 

. • ■ 

\0 ■■■ 0/ 

Theorem 6.4. $ G Lp(M„) \iJP(M„). Then $ is respectable if and only if there 
exists a maximizing vector F of H<j, such that 

rankF(C) = l, C e T. (6.9) 

Proof. Suppose that $ is respectable. Consider the Hankel operator 

Let / G H'^{C^) be a maximizing vector of Define the matrix function 

F G Hi{Mn) by 

F={f ■•• ). 
It is obvious that Ta.nkF{() = 1 for C G T. Clearly, 

l|i^(C)l|5. = ||/(C)l|c" and ||(if^F)(C)||55 = ||(i/*/)(C)||c", 
and so F is a maximizing vector of 
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To prove the converse, we may assume that \\H^\\ = 1. Suppose that F is a max- 
imizing vector of of norm 1 that satisfies fl6.9l) . Let Q be a best approximant 
to $ in HP{Mn). By Theorem [621 we have ($ - Q)F e /f^(M„). Put 

G'= ^^(($-Q)Fy and ^ = FG e H^' (Mr,). 

Clearly, 

rank^(C) = l, (eT, and < 1- 

Let us show that \l/ is a dual extremal function of $. Assuming that (I6.6p holds, 
we have by (16. 7p . 

/ trace($\[^) dm = / trace (($ - Q)FG) dm 
Jt Jt 



j trace (^P_(($ - Q)F)G^ dm 
! trace {{H^F)G) dm = {H^F,G*)L2is^) 



Thus \E' is a dual extremal function of rank 1, and so by Theorem I4.4[ $ is re- 
spectable. ■ 

Note that the computation, in fact, shows that = 1- 

Corollary 6.5. Let ^ be a weird function m Lp{M2) \ H'p{M2). Then $ has a 
unique best approximant in Hp(M.2). 



Proof. By Theorem 16. 2[ if $ has more than one best approximant, then each 
maximizing vector F of has rank 1 almost everywhere on T. By Theorem 16. 4[ 
the function $ is respectable. ■ 

We consider now for a function $ G Lp(M„), the family of Hankel operators 

i/f ^ : Hl{S:^'^), l<k<n, defined by 

Hi^^F = P_<I>F, F G H\Sl'^). 

Clearly, H^^^ = and //|"^ = H^. 

Theorem 6.6. Let $ G Lp(M„) \ /7p(M„) and letl<k<n. The following are 
equivalent: 

(i) there exists a maximizing vector F of such that 

rankF(C)<A;, C e T; (6.10) 
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(ii) the following distance formula holds: 



H. 



{k} 



dist^P ($,i/P(M„)). 



Note that a standard argument with analyticity properties of minors shows that 
rankF((^) is constant for almost all ^ in T. 

Proof of Theorem 16.61 Suppose that (ii) holds. Let G G H'^{S2'^) be a 
maximizing vector of H^'' (observe that H'^^'> is compact). Consider the matrix 
function F G H'^{S^) obtained from G by adding n — k zero columns. Clearly. 



\H^F\ 



distz^P (<l>,i/^(M„))||F|| 



Thus F is a maximizing vector of that satisfies (16.101) . 
Suppose now that F is a maximizing vector of i3"$ such that 

rankF(C) = A;, C e T. 

Without loss of generality we may assume that ||F||Lq(sn) = 1. As in the proof 
of Theorem 16.41 consider a best approximant Q to $ in if^(M„) and define the 
matrix functions G and \1/ as in that proof. Then 

rank*(C) = /c, (eT, and = 1. 

The fact that \1/ is a dual extremal function of $ can be verified as in the proof of 
Theorem 16.41 



By Theorem 13. 4[ \1/ admits a factorization \1/ 
Gb G H^(M,, „) and ||^ 



FbGb, where Fb G H" 



rb t nQyLmk,n) ana || ||ip'(sn) 



We claim that 



H. 



{k} 



Fb||^,(gn,fcj||G'b||^2(5fc.")- 



and Fb is a maximizing vector of H^''\ This 



can be proved in the same way as in the proof of Theorem 16. 1[ Indeed, without 
loss of generality we may assume that ||Fb 



1 and II ^b II ^2^^^^'"^ 



1. Then 



> 



{k} 



> 



HtF,,G; 



J trace (^(iff ^Fb)Gb) dm 



trace (P_(<l'Fb), Gb) dm 



trace($Fb(ji'b) dm 



trace {^^) dm = distil ($, i/^(M„)) 



by (14.11) and Theorem 16.11 ■ 

Definition. A matrix function $ G L^(Mn) is said to have order k ii k is the 
smallest number such that 



distiP ($, HP{Mn)) 
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H. 



{k} 



Clearly, a matrix function $ is respectable if and only if it has order 1. 
The reasoning given in the proof of Theorem 16.61 allows us to obtain the following 
formulae for the order of a matrix function in L^. 

Theorem 6.7. Let ^ be a matrix function in Lp(M„). Then the following 
assertion hold: 

(i) the order of $ is the minimal number k, for which there exists a maximizing 
vector F of Hq, that satisfies fl6.10p . 

(ii) the order of $ is the minimal number k such that $ has a dual extremal 
function satisfying 

rank^(C)<A;, (eT. 

Proof. It is easy to see that the proof of Theorem 16.71 is contained in the proof 
of Theorem 16.61 ■ 

In §7 we obtain one more formula for the order of $, see Theorem 17. 2[ 

We can obtain now an analog of Theorem 16.21 for the Hankel operators H^^\ 

Theorem 6.8. Let ^ be a matrix function in L'P{M.n) such that 

dist^p {^,HP{Mn)) = 

Then the following assertions hold: 

(i) z/F e Hi{Mr,^k) IS a maximizing vector of H^^ , then {^-Q)F G H^^{Sl'^); 

(ii) if Fi and F2 are maximizing vectors of then 

for some positive constant c; 

(iii) if Q is a best approximant to $ in iJ*'(M.„) and F is a maximizing vector 
of then the matrix 

is isometric on the range of F{(). 

Theorem 16.81 can be proved in the same way as Theorem 16. 2[ 

Remark. Note that in the case p = 00 and k = 1, (ii) is very far from being 
true. Indeed, we can take two different scalar outer functions hi and /12 in and 
consider the matrix function $ defined by 
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It is easy to see that $ is badly approximable, ||-f^<i.||H2(c2)->H2 (c2) = 1, and the 
vector functions 

- Co) - it 

are maximizing vectors of though the functions 

C^II/i(C)l|c2 = |/^i(C)l and C^||/2(C)l|c2 = 1/^2(01 
do not have to be proportional 

7. p-badly approximable functions 

In this section we characterize the set of all badly approximable functions in 
terms of certain special factorizations. Such factorizations allow us in this section 
to obtain a parametrization of all p-best approximants to a given matrix function 
in L^iWln) in the case when such a best approximant is not unique. 

To describe the set of p-badly approximable matrix functions, we prove the 
following result that can be considered as an analog of the corresponding result for 
analytic approximation in the L°° norm, see [Pel] , Ch. 14, § 15. 

Theorem 7.1. Let $ G Lp(M„,) and let Q be a best approximant to $ in 
H^{Wln). Then $ — Q admits the following factorization 

$-g = w*(^ (7-1) 

where V and are k-balanced matrix functions for some k < n, A is a 
k X k p-badly approximable matrix function such that the matrix function d^^'^A is 
unitary-valued, and is a matrix function such that 

||$#(C)||m„_, < ||A(C)lk, CGT. 

Proof. Clearly, without loss of generality we may assume that Q = 0, i.e., $ is 
a p-badly approximable matrix function. 

Suppose that H,^ has a maximizing vector of rank k. In the proof of Theorem l6.6l 



we have shown that 



{k} 



distiP H^iWln)) and there exists a maximizing 



vector F e H'i{Mn,k) of hI^^ such that rankF(C) = k, ( e T. Consider the 
inner-outer factorization of F*: 



F' = 0\Fl 



Then 



F = FM 

It is easy to see that Oi is an inner matrix function of size k x k and F^o is a 
co-outer matrix function of size n x k. It follows easily from (i) of Theorem 16.81 
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that Fco is a maximizing vector of . Without loss of generahty we may thus 
assume that F is co-outer. 

Let G be the function in if^(M„ fc) defined by 



G{0 = C{Hi'^F)iC). 



By (i) and (iii) of Theorem 16.81 we know that G = z^F has rank k on T. 

Similarly, we can consider the inner-outer factorization of and obtain a fac- 
torization 

G = Gco02i 

where O2 is an inner matrix function of size k x k and Geo is a co-outer matrix 
function. 

Consider now the inner-outer factorization of F 

F = TFo. 

Since rankF((^) = k almost everywhere on T, it is easy to see that T has size nxk. 
Similarly, we can consider the inner-outer factorization of Geo: 

Geo = 0Go 

and has size nxk. 

We can consider now balanced completions V and W of T and 0: 

V = ( T ) and = ( S ) , (7.2) 

where and S are inner and co-outer matrix functions such that the matrix 
functions V and W defined by (17. 2p are unitary- valued (see §2). 
Let 

A = W$V. 
By Theorem EJl H^^^F = <I>F, and so 

$F = W*AV*F = W*A Qt ^ TFo = W*A (^^^^ =zG = zGM- 

Thus 



Clearly, P(C)||m„ = ||$(C)I|m„, C e T, and by Theorem O (see dHS])), ^"jf^ 
is a maximizing vector of A{() for almost all C ^ T. Let 

A = 

where An has size k x k. 



All Au 
A21 A22 
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By Theorem 16.81 the matrices ||74(C)|| ^Aii{() take unitary values almost ev- 
erywhere on T. It is easy to verify (see e.g., |Pelj . Lemma 15.5 of Ch. 14) that 
A21 = 0, A,2 = 0, and 11^22(011 < Pii(C)l|, C e T. 

Clearly, \\A{C)\\ = d<i,(C). Put A = An- Then (d<i,)-iA is a unitary-valued 
matrix function and 

* = '^"(o *,)^'- 

where $# = A22. Obviously, ||$#(C)|| < ci$(C), C eJ. 

It is easy to see that H^Fq = ZG0O2, and so ||-fi"A|| = ll-f^*!!, which implies 
that A is a p-badly approximable matrix function. ■ 

Remark 1. Note that the matrix function A is determined by the choice of a 
maximizing vector and it does not depend on the choice of a p-best approximant 
Q. It is also clear that the /c-balanced matrix functions V and W do not depend 
on the choice of Q either. 

Remark 2. Clearly, we can always take k to be the order of $. However, 
the choice of k is not always unique. For example, if G is a scalar p-badly 
approximable function and / G if is a maximizing vector of i?^, then it is easy 

to see that "^"^q ^^^^^ respectable p-badly approximable matrix function 

and the matrix function -^~^q j^isa maximizing vector of = 

Thus the matrix function $ admits factorizations of the form 17.11 with k = 1 and 
k = 2. 

Definition. We say that a matrix function $ G L^(M„) has gender k ii k is 
the maximal number such that has a maximizing vector of rank k. Clearly, in 
Theorem 17.11 we can take k to be the gender of $. 

Factorizations of the form fl7.1l) allow us to obtain one more formula for the 
order of matrix functions in L^. 

Theorem 7.2. Let ^ be a matrix function in L^{M.n) and let Q is a p-best 
approximant to $. Then the order of ^ is the minimal number k such that $ — Q 
admits a factorization as in (17. ip with k-balanced matrix functions V and W*. 

Proof. The proof of Theorem 17.11 shows that if k is the order of then $ — Q 
admits a factorization of the form (17. ip with fc-balanced matrix functions V and 

Suppose now that (17. ip holds with /c-balanced matrix functions V and W*. Sup- 
pose that V and W are given by (17. 2p . 
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Let G e ^^"(5^) be a maximizing vector of i?A- Consider the matrix function 
F e Hi^S""/) defined by 

F = TG. 

We have 



;* - Q)F = W ('^ (^ri TG 



- 4) (o) ^ ("o''' 



Since G is a maximizing vector of H and A is a p-badly approximable matrix 
function, it follows from Theorem 16.21 that AG = H/^G, and so 

($ - Q)F = 0HaG. 

It is easy to see that F is a maximizng vector of and ||-ff<i>|| = ||//|'^"'^||. This 
proves the result. ■ 

Remark. If $ — Q admits a factorization as in (17. ip with /c-balanced V and 
W, where k is the order of $, and L is a dual extremal function of A, then a dual 
extremal function \E' of $ satisfying the condition 

rank^(C) = A;, (eT, 

can be obtained by the following explicit formula: 

^ = TL0*. 

Indeed, we may assume without loss of generality that $ is p-badly approximable 
and Q = 0. We have 

II^IIlp'(s?i') = II^^IIlp'(s*:) = 1- 

Clearly, 

trace ($ (C)^(C))) = trace (0*(C)$(C)T(C)L(C)) 



trace (a(C)L(C)). 



Thus 

'trace($(C)^(C))c^m(C)= / trace (A(C)L(C)) f/m(C) 



|A||LP(Mfc) — ||'^'||lp(M„)- 
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Let us now describe all p-badly approximable matrix functions. Note that similar 
results hold in the case p = oo under certain restrictions on the function, see |Pelj . 
Ch. 14, § 15. 

Theorem 7.3. Let $ be matrix function in Lp(M„). Then $ is p-badly approx- 
imable if and only if there exists k < n such that $ admits a factorization 



A 



$ 



# 



V*, 



(7.3) 



where V and W* are k-balanced matrix functions, A is a kxk p-badly approximable 
matrix function such that the matrix function dJ^A is unitary-valued, and is 
a matrix function such that ||$#(C)||M„_fc < ll^(C)l|Mfe for almost all C G T. 

Proof. The fact that p-badly approximable matrix functions $ admit factor- 
izations of the form (17.31) follows immediately from Theorem 17.11 

Suppose now that $ is given by (17. 3p . Consider the Hankel operator 
Ha : H''{S^2) Hl{Sl). Let G Hi{Sl) be a maximizing vector of H a. 
Since A is p badly approximable, it follows from Theorems 16.11 and 16.21 that 



Hi.T = AT and \H kT 



l$l 



Lf(M,0 



\T\ 



Consider the matrix function F = TJF, where T is as in (17. 2p . We have 







(o 


$# ) ( 






(o 


) ( 







(0 E) 



AJ^ 




Thus 



Hi'^^F 



l$l 



LP {ft 



= 0AJP- E HliS 

-H'a^||l2(s§) 
11$ 



n,k\ 



LP{U„)\ 



It follows that 



H. 



{k} 



||*^'||lp(m„), and so $ is p-badly approximable. ■ 
The next result allows us to parametrize all best approximants in the case when 
there are more than one best approximant. A similar result also holds in the case 
of approximation in the norm of L°° under certain restrictions on $, see [Pel] . Ch. 
14, 515. 
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Theorem 7.4. Let $ and Q be as in Theorem U .1\ and let ^ — Q be factorizred 
as in fl7.ip . A matrix function R G H'P{M.n) is a p-best approximant to $ if and 
only if there exists a matrix function G H^(Mn-k) such that 

$ - = W* ( ^ ^ IV* (7 A) 

\ ^# ~ R# J ' ^ ' 

and 

||($#-/?#)(C)I|m„., <||A(C)||m„ CgT. (7.5) 

We need the foUowing lemma. 

Lemma 7.5. Let V and W* be k-balanced matrix functions of size nx n. Then 

W«vn(° i.,M„_,)) = (o i7^(M.._.))' 

For p = oo this is Theorem 1.8 of Ch. 14 of [Pel] . The proof given in [Pel] also 
works in our case. 

Proof of Theorem 17.41 Suppose that i? is a best approximant to $. Then by 
Theorem I7.H $ — i? admits a factorization 

where A, V, and W are as in fl7.ip and $^ is a matrix function such that 
||$b(C)||M„_, <||A(OI|M„CeT. Then 

i?-Q = n;*(^ . ^ . ]v*. 

By Lemma ESI R# = $# - $b G HP{Mn-k). 

Conversely, suppose that R^ is a matrix function in if^(M„_fc) such that (17.50 
holds. Then by Lemma [7.51 there exists R G if^(M„) such that 

Then (17.40 holds. It follows easily from Theorem 17.31 that i? is a p-best analytic 
approximant to ■ 

Theorem 17.41 says that to describe all p best approximants, we should describe 
all functions R^ G H^iMn-k) such that (17.51) holds. By Theorem 16.31 there exists 
a scalar outer function x in such that |x((^)| = ||A(^)||Mfc = d$(C), C ^ I"- 
Clearly, a matrix function R^ in i7^(M„_fc) satisfies (17.51) if and only if the matrix 
function satisfies the inequality 

||x"-^$# - x'^R^Wl^ < 1. 
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In other words, this reduces the problem of the description of all p-best approxi- 
mants to the problem of describing all matrix functions Q in H°°{WS.n-k.n-k) such 
that 

||x"^$# - QIIloo < 1. (7.6) 

Note that the problem to describe all H°° matrix function Q satisfying (17.61) 
is the classical Nehari problem and in the case of nonuniqueness there is formula 
parametrizing all solutions. It was obtained by Adamyan, Arov, and Krein in 
[AAKlj and |AAK2j under certain assumptions and by Kheifets |Khe] in the most 
general case; see also Ch. 5 of [Pel] . 



8. p-superoptimal approximation 

In this section we introduce the notion of p-superoptimal approximation and 
prove that if $ is a rational matrix function then $ has a unique p-superoptimal 
approximant. 

Definition. Let $ G Lp(M„) \ ifP(M„). For a function Q G i/P(M„), we define 
the numbers Tj(<I>, Q), < j < n — 1, by 

Tj{^,Q) = ess sup — — . 

Ceir d$(g 

A function Q G iJ^(M„) is called a p-superoptimal approximant to $ if it minimizes 
lexicographically the sequence Tj($, Q), < j < n — 1. 
If Q is a p-superoptimal approximant to $, we put 

r,($)'=^^,(<|.,Q), 0<j<n-l. 

Clearly, if Q is a best analytic approximant to $ in L^(Mm „), then ro($, Q) = 1. 
It is also clear that if F is a p-superoptimal approximant, then F is a best analytic 
approximant in L^{Wlm,n)- 

It is easy to see that if $ has gender k, then 

Tj{^) = 1 for J = 0,--- ,k-l. 

In this section we are going to work with rational matrix functions. When we 
say that a matrix function defined on the unit circle T is rational, we mean that 
it is a restriction of a rational matrix function to the unit circle. It is easy to see 
that if A is a rational matrix function, then its adjoint A* is also a rational matrix 
function. 

Suppose now that $ is a rational matrix function of size n x n with no poles in 
T and k is the gender of As in the proof of Theorem 17. 11 we consider a co-outer 
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maximizing vector F of the Hankel operator , the matrix function G defined 

by G = zH^^ the factorizations 

G = G,,0, F = TF,, and G,, = 0Go, 

where O is an inner matrix function of size k x k, T and are inner and co-outer 
matrix functions of size n x k, Geo is a co-outer matrix functions of size n x k, and 
Fo and Go are outer matrix functions of size k x k. We also assume that the k- 
balanced matrix functions V and W* are given by (17.21) . Q is a p-best approximant 
to $ and $ — Q is factorized as in fl7.3p . Finally, we assume that F is normalized 
so that dEZD holds. 

Lemma 8.1. Let $ fee a rational matrix function in L^(M„). Then the matrix 
functions O, F, G, V, and W are also rational. 

Proof. If $ is rational, it is easy to see that H^'^^A is rational for an arbitrary 
function A G ^^{S^'^). In particular, this is true for the function H^^^F, and so 
G is rational. 

Let us show that O is rational. It is well known (see e.g., [Pel] , Ch. 2, §5) that 
a square inner function U is rational if and only if the subspace 

Ku = H\d')QUH\C^) = UHl{e')^H\C^) 

is finite-dimensional. Since G is rational, the Hankel operator 

has finite rank (see e.g., [Pelj . Ch. 2, §,5). It is easy to see that for / G Kq^-, 

Hcf = Gf. 

Since rankG(C) = k almost everywhere on T, it follows that multiplication by G 
has trivial kernel. Thus Kq^ is finite-dimensional, and so (9* is rational. Thus O 
is rational, and so Geo = GO* is also rational. 

To prove that the matrix function Go is rational, we observe that 

CO CO '-^o O' 

and so G^Gq is a rational matrix function. The rationality of Go follows now from 
the following well-known fact (see |Y]): if Q be a matrix outer function of class 
if^(Mfe), then Q is rational if and only if Q*Q is rational. 

We have = GcoG^ ^, and so is rational. By Lemma 12.1 of Ch. 14 of |Pelj . 
the matrix function S is rational, and so W is rational. 

Let us show that V is a rational matrix function. Since $ is rational, it follows 
that 
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is a rational matrix function. Thus 



is rational. We have 



and so P_AT* is a rational matrix function. 
Let hhe a scalar outer function such that 

IMC)I = I|G(C)||5..., CeT. 

Then he H^. It follows from O and ([62D that 

l|A(C)||M, = IMC)r/^ and ||F(C)||^.,. = IMOr/^. (8.1) 

Since G is rational, the function is rational. It follows from the result from (Y] 
quoted above that the function h is also rational. 

Let us show that Fo is a maximizing vector of Ha and H^^F = 0AFo. Since 
F = TFo is a maximizing vector of -f^l*^^ , we have by Theorem 16. 8^ 

Hi"F = (* - Q)F = W °^ V-TF, 

= (0 H) (^:) TF„ = e //!(S?'). 

Since the matrix function is co-outer, it follows from Lemma 1.4 of Ch. 14 of 
|Pel] that AFo G H'^{S^). It is easy to see from (18. ip that Fo is a maximizing 

vector of and iff^F = 0AFo. Thus 



AFo = zGoO. 

Consider now the inner-outer factorization of the matrix function GoO: 

GoO = OGo. 



Clearly, both D and Qo are rational matrix functions. Then AFq = zDQo, and so 

A = zWoFo' 

Put 

U = zh-^/PQ'A = go{h'''PFo)-\ 
It is easy to see that U is unitary-valued. Put 

Q = h'"PFo and Q# = Gl 

Since U is unitary-valued, it is easy to verify that 
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Clearly, Q is outer. Since Q^Q*^ is rational, it follows from the result of [YJ quoted 
above that Q = h^^^Fo is rational and U is rational. 
We have A = zh'^^PDU, and so 

is a rational matrix function. Put 

m = F_zOQ*^Q-^T*. 
Let us show that 91 is rational. We have 

and since F.h'^^P^ is rational and h'^^'^ E it follows that F.h^i is rational. 
Since h is rational and = P_1H, it is easy to see that is rational. 

Finally, since the matrix functions zDQ*^Q-^ and F^zDQ*^Q-^T* are rational, 
it is easy to verify that T* is rational. Again, it follows from Lemma 12.1 of Ch. 
14 of jPelj that V is rational. ■ 

To prove the next theorem, we introduce the notation L^,, < a < 1, for the 
class of Holder functions of order a: a function on T is said to belong to the 
Holder class L^ if 

K-t\" 

In the following theorem we keep all the notation as above. 

Theorem 8.2. // $ is a rational matrix function, then A G £2/p(Mfc) and 

Proof. We have 

A = zh^/PDU, 

where OU is a rational function. If h has no zeros on T, then A is infinitely 
different iable. If h has zeros on T, then, obviously, h'^^^ G L2/p, which implies that 
A G L2/p(Mfc). 
Next, since 

'A 



it follows that is a sum of a rational matrix function and an matrix func- 
tion. Thus there exists a finite Blaschke product B such that = B^^y, where 
G HP{Mn-k). We also know that ||$c?(C)||M„_fc < IHOl'^^^- Since h is outer, it 
follows that h-^/P<^^ G H^(M„_k). Thus 

/i-2/p$^ = Bh-'^'P^<y G {H^ + C)(M„_fe). ■ 
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Theorem 8.3. Let $ be annxn rational matrix function. Then $ has a unique 
p-superoptimal approximant Q. Moreover, 



s 



T. 



0<j<n-l (8.2) 



d*(C) 

almost everywhere on T. 

Proof. Let be a best analytic approximant to $ in Lp(M„). By Theorem 
17.11 $ — i? admits a factorization of the form 



where V and W* are fc-balanced matrix function, k is the gender of $, A is a 
p-badly approximable kxk matrix function function such that the matrix function 
dJ^A is unitary-valued, and is a matrix function such that 

||<i>#(C)||m„_, < ||A(C)||m„ CeT. 

It follows from Theorem 17.41 that a matrix function Q G i/*'(M„) is a 
j9-superoptimal approximant to $ if and only if 

where Q# G if^(M„_fc) is a matrix function such that h~'^^^Q# is a superoptimal 
approximant of in L°°. Here h the scalar outer function as in the proof of 

LemmaEH i.e., \h^/P\ = d$. By Theorem[H2l h-^/P^# G H'^+C, and by Theorem 
3.3 of Chapter 14 of [M], h-^/P<^# has a unique superoptimal approximant in the 
L°° norm. 

Formula (18.21) is an immediate consequence of Theorem 3.4 of Chapter 14 of 
jPel] . ■ 

The following example shows that there are matrix functions in L^, for which 
there are different p-superoptimal approximants. 

Example. Let be a scalar L°° function such that 

llty^llioo = distL°°{^p, H°°) = 1, 

and such that there is a nonzero best approximant / G H°° in the norm of L°°. It 
is well known that such functions ip exist (see, e.g., |Pelj . Ch. 1, §1). Consider 
the matrix function $ G L°°(M2) defined by 







It is easy to see that both the zero function and the function ( „ j, 1 are 



p-superoptimal approximants for any p G (2, oo). 
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